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ABSTRACT 

Template matching is widely used for many applications in 
image and signal processing and usually is time-critical. Tra¬ 
ditional methods usually focus on how to reduce the search 
locations by coarse-to-fine strategy or full search combined 
with pruning strategy. However, the computation cost of those 
methods is easily dominated by the size of signal N instead 
of that of template K. This paper proposes a probabilistic 
and fast matching scheme, which computation costs requires 
0{N) additions and 0{K log K) multiplications, based on 
cross-correlation. The nuclear idea is to first downsample sig¬ 
nal, which size becomes 0{K), and then subsequent opera¬ 
tions only involves downsampled signals. The probability of 
successful match depends on cross-correlation between sig¬ 
nal and the template. We show the sufficient condition for 
successful match and prove that the probability is high for bi¬ 
nary signals with > 0{N). The experiments shows this 
proposed scheme is fast and efficient and supports the theo¬ 
retical results. 

Index Terms — Circulant matrix. Cross-correlation, Sub¬ 
sampling, Template matching 

1. INTRODUCTION 

Template matching is a task of searching a given template in 
a given signal. It has been widely used in many applications, 
including communication synchronization |[T1, quality control 
El, compression a, object detection a, and etc. Given a 
template t G and a signal x G , template matching is 
to solve 

arg max Sim{x^,t), (1) 

0<k<N ^ ' 

where a;'" = [{x)k,{^)o, (a;)fe rep¬ 
resents the entry of x, and Sim (•) denotes a kind of sim¬ 
ilarity metric. 

One main challenge of solving Eq. ([T]l is to incur high 
computation time but many applications demands real-time 
and are energy-critical. There are two factors influencing 
computation overheads. One is what similarity measures, 
including cross-correlation (CC) Si), normalized cross¬ 
correlation (NCC) Q®, and sum of squared differences 


(SSD)®, are used. Similarity measure influences perfor¬ 
mances as well. Another is the search strategy. For example, 
exhaustive search (full search) that is conducted by checking 
x^ from fc = 0tofc = 7V — 1, which is unacceptably slow. 

Many techniques have been developed to overcome these 
difficulties as follows. 

• Coarse-to-fine strategy GQiiniiii: First, a coarse 
search is conducted by finding the downsampled tem¬ 
plate in the downsampled image to yield a good match 
with less computation overhead. Then, a fine search 
is conducted in the original space starting from the 
neighborhood of the best match found in coarse search. 

• Full search-equivalent strategy iniiTiiis): It employ 
rejection schemes derived, where current search is ter¬ 
minated as soon as some criterions are satisfied. 

• Fast convolution by Fast Fourier Transform (FFT) 
QEl: For specific similarity measures such CC and 
SSD, Eq. ([T]i requires convoluting x with t. By con¬ 
volution theorem, this operation can be quickly done 
in the frequency domains of x and t. In particular, III 
proposing using sparse FFT iflhl to replace FFT for 
furhter reducing the computation cost. 

In this paper, we propose a fast template matching method 
using downsampled circulant matrix and cross-correlation. 
For image-based applications, the performance using cross¬ 
correlation is usually inferior to that using NCC or SSD. But, 
for binary signals such as CDMA codes in communications, 
it indeed works well fl]. Our idea is to search the match for 
downsampled signal, and no search in the higher resolution 
space is required. It results in the fact that the computation 
overhead, which is far less than other approaches, is related 
to the size of downsampled signal. The crux of our method 
relies on how to design a circulant matrix to achieve fast tem¬ 
plate matching and can be considered to be an extension of 
m. The main diferences are two-fold; (1) Our idea is based 
on exploiting the commutative property between circulant 
matrices instead of adopting sFFT, leading to the advantages 
that the proposed scheme is more simple and faster than lU . 
(2) The computation cost of our method benefits from the size 
of template, but that of III only depends on the size of signal. 



2. NOTATIONS 

We briefly introduce the notations used in this paper. Let 
V € be a vector and let circ{v) be a circulant matrix gen¬ 
erated based on the seed vector v. For example, V = circ(v), 
where the first row of V is (f)!, ('w)Af-i], the sec¬ 

ond row is [(i;) 7 v_i, (t;)o,(tt)Ar_ 2 ], and the last row is 
(t>) 2 ,(rt)o]- Oix(Ar) represents the 1 x At zero vec¬ 
tor. If It S and v G [u r;] G Rtw-t-Af denotes a 
vector by concatenating u and v. We simplify notations to 

use = (x)i modulo N- 


3. PROPOSED METHOD 

Template matching based on cross-correlation is equivalent to 

if-i 

solving m = argmax {Tx)k with {Tx)k = {t)i{x)k+i, 

k 

2=0 

where T = circ{[t 0ix(Ar_if)]). Specifically, {Tx)k is con¬ 
sidered as the matching result at position k of x and m is 
considered as the ground truth with the maximum correlation, 
{Tx)m- The goal of the proposed method is to And an index 
rh, which is expected to be equal to the ground truth m, with 
fast computation. 

The key idea is to downsample an original signal x via a 
sampling matrix to become a low-dimensional signal y, 

i.e., y = $a;, and then find a matrix T G such 

that Ty = T^x = ^Tx. Since, compared with x, y has 
lower dimension, the computation cost of Ty is lower than 
that of Tx. In other words, Ty is equivalent to fast convolv¬ 
ing X with T. Moreover, ^Tx is considered to downsample 
Tx, which is a vector representing matching results based 
on cross-correlation. Nevertheless, downsampling {i.e., for 
^Tx) also leads to the side effect that the matching result 
with the maximum cross-correlation cannot be identified in¬ 
tuitively from Ty. To overcome the difficulty, we first require 
at least two downsampled signals for conducting template 
matching and then by Chinese Remainder Theorem (CRT) the 
ground truth can be correctly identified. 

The proposed algorithm is depicted in Algorithm[T] which 
is mainly composed of four operations: 

1. Downsampling: Downsample original signal x into 
low-dimensional signal y, where each entry of y is the 
sum of [entries in x (Step 2-3). 

2. Fast convolution in low-dimensional space: Convolve 
y with t and obtain the result r (Step 4-6). By commu¬ 
tative property between two circulant matrices, y being 
convolved by t is equivalent to x being convolved by t. 
Each entry of the convolution result r is transformed as 
the sum of entries in Tx. 

3. Matching position finding: If an entry in r is large, it 
implies that one of entries in Tx is also large 


Algorithm 1 Fast Template Matching 


Input: X G R™, t G ; Output: m; 

01. function FTM() 

02. Pick two co-prime integers Mi and M 2 as downsampling 
factors such that Mi M 2 > N, Mi > K, and M 2 > K\ 
03. Downsample x into ?/Mi and yM- 2 , where 

{yMi)k= ^ (a3)fe+iMi for fe = 0,..., Ml-I-A - 1, 

i=0 

{yM 2 )k = ^ (a3)fe+iM2 for fe = 0 ,..., M 2 -I-A - 1 ; 


i=0 


04. 

05. 

06. 

07. 

08. 


Assign tMi = [f OixMi] and fMa = [f OixMa]; 
yMi = FFT{yMi), and yM^ = FFT{yM2)\ 

(mi = FFT{tMi), and fMa = FFT{tM 2 )\ 
rMi = IFFT{iMi.*yMi),rM2 = IFFT{iM2-*yM2)\ 
where denotes pixel-wise multiplication, 
mi = arg max(rMi)i, m 2 = arg max (rMa)^ 

i<M\ i<M2 


Smi = = mi + i * Ml for i = 0,..., 

Sm 2 = |fc|fc = m 2 + i* M 2 for i = 0,..., 



09. m = Smi n Sm 2 : 

10. end function 


with high probability. Thus, by searching the maxi¬ 
mum value in r (Step 7), it provides the information, 
a set of [candidate positions, including m. 

4. Best matching position choice: Based on the Chinese 
Remainder Theorem, we can further identify the best 
template matching result as the unique position rh (Step 
8-9). 

The main operations are discussed in detail as follows. 
We will use M to denote the downsampling factor if there is 
no confusion. 

3.1. Downsampling 

Let y be a downsampled signal defined as: 

{y)k = X! = {‘^x)k ( 2 ) 

with 

$ = X>M(circ(r)), (3) 

where 'Dm{-) is the function that outputs the first M rows 
of its argument and r is generated as follows: {r)k = 1 if 
k mod M — 0 for 0 < k < N — 1. 

There are two useful properties about $: 1) $a; only in¬ 
volves additions and 2) fi) is a circulant matrix. It should be 
noted that M controls the dimension of a downsampled sig¬ 
nal. Furthermore, the proposed method requires M > K\ 
otherwise, there will be no matrix satisfying Ty = ^Tx (de¬ 
tails will be discussed later in Theorem[T]and Theorem|2]i. 









3.2. Fat convolution in low-dimensional space 

After downsampling, we find a matrix T G ^ ^ such that 
T^x = ^Tx. If this strategy is feasible, it reduces the com¬ 
putation cost since the dimension of T is smaller than that of 
T. However, the problem is not intuitive but difficult in that it 
is not simply related to the commutative property in matrices. 
Valsesia and Magli ifTTll propose Theorem [T] stating under 
what sufficient conditions the commutative property holds. 


Proof. If both A G and B G R^^^ are cir- 

culant matrices, then matrix multiplication is commuta¬ 
tive, namely AB = BA. In our case, Ty = T^x = 

T’[$m ^m...^m]x = (t^m^ ... (t^m^]x. 

Since both <I>m and T are circulant matrices, T^m = ^mT. 
Thus, 

f^x = [($mT) ($mT) ... (^mT)]^: = ^tx, 


Theorem 1. (slightly revised from Let T = czrc([t Oix(Ar-i<')]) G 

^NxN ^ circulant matrix, where t is a 1 x K (K < M) 

non-zero vector. Let T G R^^.^ = circ{[t Oix(M-i<')])> 

let ^ be a M x N partial circulant matrix, and let y = k>x where 

and $T x denote the measurements of x and measurements 

of filtered signal (Tx), respectively. Then, 


T 0 

0 f 
0 0 


0 

0 

f 


□ 


Ty\ = (f^Tx)^ if and only if i € [0, M — K]. 

/ i 

In fact. Theorem [T] also implies that there are K mis¬ 
matches, namely {Ty)i f {T^x)i for M — K < i < M — 1. 
These mismatches may lead to the failure of the proposed al¬ 
gorithm. For example, {Ty)k, in fact, is the sum of 
entries of Tx. If k is the best match position involving 
{Tx)m, and {Ty)k = {^Tx)k (without mismatches), it 

means {fy)k = {Tx)m + rj, where y ^ ^ {Tx)k+iM 

and m = k -\- iM. We can anticipate that {Ty)k may be 
large enough since it involves {Tx)m. On the contrary, 
if {Ty)k 7 ^ {^Tx)k (with mismatches), {Ty)k is unpre¬ 
dictable since {Ty)k = {Tx)m -t- r; no longer holds. Under 
the circumstance, the proposed algorithm loses the informa¬ 
tion about {Tx)m. 

To deal with this problem, we consider two strategies: 1) 
Instead of using Vm, we use Vm+k in Eq. Q to produce $. 
In other words, according to Theorem[T] {Ty)k = {^Tx)k 
holds for 0<fc<M — 1. It should be noted that the set of 
candidate positions collected from {Ty)k for 0 < k < M — 

1 already includes the information of {Tx)q, ..., {Tx)i^-i. 
Thus, no information is lost. However, it results in more over¬ 
heads since the size of y is increased to M -f AT. 2) To further 
reduce such overheads, we derive another sufficient condition 
without mismatch by designing a new sampling matrix $. 

Theorem 2. Suppose ^ is an integer. Let T = circ([f 0ix(Ar- 
^NxN Q circulant matrix, where t is a 1 x M non¬ 
zero vector, and let T G R.^^^ = circ(t). We also let 
$ = P’m be an M X N matrix, where A 

^ V 

N 

TT 

an M X M circulant matrix. Then, we have the measure¬ 
ments y = ^x and measurements ^Tx of filtered signal 
TX. Therefore, we have 


Theorem 12] holds when M divides N. Otherwise, we can 
pad zeros into the tail of x until M divides N. 

3.3. Matching position finding 

Let y be the convolved signal with y = Ty. By Theorem [T] 
or Theorem]!] V can be rewritten as: 

(y)fe = ^ (Tx)k+iM. (4) 

i=o 

If TO = argmax(y)fe, we are interested in the question that 

k<.M 

whether the ground truth to belonging to the candidate posi¬ 
tion set C, which is C = [k\k = m -\- i * M for z = 0,..., [ 

If yes, the proposed algorithms correctly hnds C to include 
TO. We examine the sufficient condition of successful trials in 
Theorem]!] 

K-l 

Theorem 3. Let {Tx)k = ''^^{t)i{x)k+i. Let the desired 

i=0 

matching result be m = argmax {Tx)k. If—r^ — {Tx)^ > 

k 2| jjj |-|-1 

ma'Xk^m{Tx)k, then the proposed algorithm determines rh 
such that TO G C. 

Proof. Without loss of generality, we assume to = 0, imply¬ 
ing TO = 0 such that to G {fc|z * M for z = 0,..., [~ l}. 
If (y)o > max(y)i, the proposed algorithm correctly picks 

I 0. 

To show the sufficient condition of (y)o > max(y)i, we 
hrst derive the lower bound of (y)o, which is 

(y)o = (Tx)o + ^ {Tx),m 

2=1 


= i<^Tx)^ 


if and only if i & [0, M — Ij. 


> {Tx)o - 



1 I maxiratii.. 








Then, the upper bound of maxi^o(y)i is derived to be 
maxj 5 ^o(y)i < [^1 maxfc5,4o(T'tc)fc. Thus, we have 

{Tx)o - ([^1 - 1 ) Taiaxk^o{Tx)k > maxk^o{Tx)k 

^ 2 |'iX”| +1 (^®)o ^ ^^^k^oi'I'x')k- 

We complete the proof. □ 

Although Theorem|3]provides the sufficient condition, the 
successful probability is still unknown. In fact, the probability 
is related to the signal type of x. We will give a more detailed 
analysis taking the practical application as an example later. 

3.4. Best matching position 

The operation “Matching position finding” yields a set of can¬ 
didate positions, but the unique position still is unknown. The 
following discussion is based on the prerequisite that the set 
of candidate positions includes the correct solution m. 

To identify the correct solution, it is equivalent to solving 
an unknown variable m such that m = fh mod M. The prob¬ 
lem is efficiently solved by Chinese Remainder Theorem as 
follows. 

Theorem 4. (Chinese Remainder Theorem (CRT}} Any inte¬ 
ger m is uniquely specified mod N by its remainders mod¬ 
ulo a relatively prime integers Mi, M2, .... Ma as long as 
u:=iM^>n. 

More specifically, we consider two co-prime integers 
in this paper. Let Mi and M 2 be relatively prime inte¬ 
gers such that Ml M 2 > N. Hence, the two equations, 
m = TOi mod Ml and m = m 2 mod M 2 , where mi and m 2 
are the results obtained from the third operation “Matching 
position finding,” have the unique solution m that is the best 
matching position. 

3.5. Practical Applications 

In this section, we discuss the practical applications using the 
proposed algorithm. Among them, synchronization is a criti¬ 
cal issue in communications. For example, global positioning 
system (GPS) consumes 30%-75% power for synchroniza¬ 
tion. Thus, it is crucial to develop cost-effective GPS syn¬ 
chronization. The problem is defined as follows. 

• In the sender, a spreading code x G {1,-1}^, which 
is also known by the receiver, is sent to the receiver. 

• The receiver obtains the delayed code: 

X — ..., }x}jY—i, (tc)Q,..., {x}jYi—i} . 

• By comparing the spreading code it = x in this pa¬ 
per) and the delayed code, the receiver solves m = 
arg max Sim(x^,t). 

0<k<N ^ ' 


It should be noted that whatever the size of t is, the com¬ 
putation complexity of fast convolution by FFT is invariant. 

In other words, even though the size of template is smaller 
than TV, it cannot reduce the computation cost. 

Nevertheless, the proposed method is not necessary 
to set f = X for synchronization and can benefit from 
low-dimensional template. In fact, it suffices to set t = 
[(a;)o, ..., (x)k-i\ with the size K < M < N. We prove in 
the following theorem the successful probability and compu¬ 
tation complexity of our method. 

Theorems. Let a = mm{Mi, M 2 ) and let = max(Mi,M 2 ). 
If X G {1,-1}^, where 1 and —1 have equal probabil¬ 
ity, and t = ■■■,{x)m+K-i\ for any m, then our 

proposed algorithm perfectly identifies the matching re¬ 
sult rh to be equal to the correct position m, i.e., m = 
m, with the successful probability being larger than 1 — 

2 ^0(Q!e I ) + 0{-^ \))^ and computation costs of 

0{N fj log /3) for additions and 0{f3 log /3) for multiplica¬ 
tions. 


Proof The proposed algorithm can be roughly divided into 
two matching problems based on Mi and M 2 . Only when 
both matching problems succeed, the proposed method is 
considered to be successful. In the following, we first take the 
matching problem based on Mi as an example. Definitely, 
the analysis is also applied to another matching problem 
involving M 2 . 

The probability that the matching problem employing Mi 
reports an incorrect result is equal to 


= Pr 


(T’2/)m < max(ry)j 

i^m 


Since {Tx)k = k+i, we have 

• If k = m, (Tx)m = = K. Further¬ 

more, E [{Tx)m] = K and Var [(Tx)m] = 0. 

. If fc ^ m, E[{Tx)k] = E[{t)fix)k+i] = 0 

due to the independence between {f)i and {x)k+i- Fur¬ 
thermore, both {f)i{x)k+i and {t)j{x)k+j are indepen¬ 
dent fori ^ j. Thus, Var [{Tx)k] = YaJq" [{t)i{x)k+i] 
K. 


[^1-1 

Recall that (Ty)fc = ^ {Tx)k+iMi- 


LetS'fe = 


i=0 


= k-\-i* Ml for i = 0 ,..., 


N 

Ml 


- l|. Thus, 




If TO G Sk, E 


(Ty)k 



E [{Tx)k+iMi] = 

i=0 


K. Funhermore, E[{Tx)k+iMATx)k+jMl] = 0 for 










We complete the proof 


i ^ j. It implies Cov [(Tx)k+iMi, iTx)k+jMi] = 0 
fori ^ j. Thus, I^ar {fy)k = ^ Var [(Tx)k+iMi 

i=0 

riv 

l)iT. 


Ml 


• If TO ^ Sk, E 
JL K 

Ml ■ 


(Ty), 


= 0 and Var 


(Ty), 


Then, we will bound the probability by the follow¬ 
ing events: El : 3fc and TO ^ Sk, s.t {Ty)k > E 2 '■ 
3k and m G Sk, s.t {Ty)k < -f-- If none of the events 
holds, then the algorithm output is correct. In other words, 

=Pr[Pi]+Pr[P2]. 

First, we discuss the probability, Pr [Pi]. We start from 
the probability that for each k such that to ^ Sk, {Ty)k > 
E, It should be noted that, in this case, [Ty)k is consid¬ 
ered as a sum of independent random variables taking val¬ 
ues in {1,-1} with the probability i. Thus, for each k. 


Pr 


{Ty)k > f 


is derived by using Chernoff bound as fol¬ 


lows: 


Pr 


{Ty)k > 


K 


<e ^1 ^ I =e W I . 

Since the cardinality of {fc|TO ^ 5^} is at most Mi — 1, then 

K K 

Pr[Pi] < (Ml - l)e H wl < Mie "I wl . 


Second, Pr [P 2 ] is bounded by using Chebyshev’s in¬ 
equality: 


Pv[E2 

] =Pr 

VI 

= Pr 

\fy)k <K-E 

= Pr 

{Ty)k 

-K<-f 

= Pr 

K - {fy)k > f 


- iW “ ^ 


Consequently, < Pr[Pi] -f Pr [P 2 ] < Mie H wl + 

K 

Similarity, the above derivations apply to the second 
matching problem based on M 2 . It results in P^^ < 



In sum, the proposed algorithm succeeds with the proba¬ 
bility being larger than (1 — Pp^){l — Pp'^). If Mi > M 2 , 
then P^^ < P^^. For simplification, let a = min (Mi, M 2 ) 
and we have (1 - P^^)(l - Pp^) > (1 - Ppf > 1 - 


0(ae^^) + 0(i ffl)) 


Regarding the successful probability. 

To prove the computation complexity, we check Algo¬ 
rithm [T] in a step-by-step manner. Step 2 depends on how 
to pick co-prime integers. In our case, co-prime integers are 
assigned by setting Mi = K and M 2 = K + 1, and, thus, 
the cost is negligible. Step 3 obviously costs 0{N) additions. 
Step 4 assigns vectors with the sizes being smaller than Mi or 
M 2 and the cost is also negligible. Steps 5 and 6 perform FFT 
and cost 0{Mi logMi) + 0{M2 log M 2 ). Step 7 searches the 
maximum value within the vector with size Mi or M 2 , and 
the cost is negligible. Furthermore, in Steps 8 and 9, the car¬ 


dinality of S'mi and Sm 2 are 


N 

Ml 


and 


N 

M 2 


respectively. 


Both of them are smaller than 0{N) 

Consequently, the computation cost is bounded by Step 
3, and Steps 5 and 6 with the total cost being 0{N + 
Ml log Ml + M 2 log M 2 ) for additions and 0{Mi log Mi + 
M 2 log M 2 ) for multiplications. For simplification, if we let 
j3 = max(Mi,M 2 ), then the cost is bounded by 0{N + 
(3 log /3) for additions and 0{f3 log /3) for multiplications. We 
complete the proof. □ 


4. SIMULATION RESULTS 

The simulations were conducted in an Matlab R2012b envi¬ 
ronment with an Intel CPU Q6600 and 16 GB RAM under 
Microsoft Win? (64 bits). We compare the proposed algo¬ 
rithm implemented in C with fast convolution by FFT and 
sparse FFT Ul. Another goal is also to verify our theoretical 
analyses. 

The testing procedure is: 

1. Generate x G {1,-1}^ and template is extracted from 
X with t = [(at)m, ..., {x)m+K-i]- 

2. Input X and t into Algorithm [T] and output to, where 
Ml = K and M 2 = K + 1. 

3. If m = m, the trial is successful. Otherwise, it is failed. 

We repeats this procedure 1000 times and calculates the suc¬ 
cessful probability. 

In Fig. [T] we verify the computation cost derived in Theo- 
rem|5] Fast convolution by FFT is considered as the baseline. 
It should be noted that all the results shown in Fig. [T] have 
successful probabilities fixed at 100%. In Fig. [Ila). the re¬ 
sults for our method are shown by fixing N at 2^® with K 
being increased from 2^® to 2^®. However, it should be noted 
that the complexities of m and IT] only depend on N. Thus, 
their computation costs are invariant to K. On the other hand, 
the proposed algorithm benefits from the smaller size of tem¬ 
plate. One can observe that the proposed method outperforms 
@ and H] when ^ > 2^ and ^ > 2^, respectively, imply¬ 
ing that our algorithm is efficient when the template size is 






































(far) smaller than the corresponding signal size. Furthermore, 
when K < 2^^, the computation cost is, in fact, dominated 
by 0{N) instead of 0{M log M) and increased slowly. On 
the contrary, when 0{M log M) dominates the computation 
cost, which will be about twice larger when K is doubled. 

In Fig. [Itb), the results for our method are shown by 
fixing K with N being increased from 2^® to 2^®. In addi¬ 
tion, the results for so-called “Proposed Method (best)” are 
shown, where N still is increased from 2^® to 2^® but, for 
each N, K is assigned as small as possible such that success¬ 
ful probabilities achieves 100% based on Theorem |5] Over¬ 
all, these results indicate that (1) Our method outperforms 0 
and HI especially when N is large enough, implying that, for 
a large-scale problem, it will be more efficient. (2) When 
N < 2^^, small sizes of templates (for dot-curve) suffice 
to quickly achieve 100% successful template matching. In 
a similar way to Fig. [Tfa), when N > 2^^, 0{N) over¬ 
whelms 0{M log M). In this case, the slopes of red solid and 
dot curves (our methods) approximate to those of solid black 
(FFTW) and solid blue (sFFT) curves, where both complexi¬ 
ties are 0{N) and 0{N log TV), respectively. 



16 17 18 19 20 21 22 23 24 25 


logg K 

(a) 



Fig. 1. (a) The computation cost versus different iT’s under 
N = 2^®. (b) The computation cost versus different TV’s un¬ 
der K = 2^®. All comparisons were conducted with success¬ 
ful probabilities being fixed at 100% 


2 ^0(ae “ I ) + 0{^ ■ In our case, a = min(Mi,M 2 ), 

where Mi and M 2 , in fact, were set to K and K + 1, re¬ 
spectively. Thus, when the condition > 0{N) holds, 

— ^ 

T'a 1 “ r^l “^0- Similarly, we have ae ^ I ^ 0 
with > 0{N). In this experiment, ^ was fixed as 

2^®. The results show that along with the increase of TV, the 
condition = 220 i,^jv-c - ® positive con¬ 

stant), which is equivalent to > 0(2^®), holds. Under 
the condition. Algorithm [T] succeeds with high probability, as 
depicted in Table[T] 


Table 1. Successful probabilities under AT = 


TV 

2^® 

2^ 

22b 

22® 

2^ 

Successful Prob. 

0.05 

0.21 

0.85 

1 

1 


Finally, we test whether the proposed method is robust to 
noisy inference. In this case, let Xf. = x + e, where e G is 
additive Gaussian random noise. Both Xe and t were fed into 
Algorithm[T] The parameters, TV = 2^® and M = 2^®, were 
chosen because the corresponding successful probability lives 
on edge between 100% and < 100%, which is expected to be 
interfered by noise obviously. One can observe from Tabled 
that our method works well when SNRs are larger than 1 dB. 


Table 2. Successful probabilities under TV = 2^®, K = 2^®, 
and different SNRs. _ 


SNR (dB) 

20 

10 

6 

1 

-2 

-6 

Successful Prob. 

1 

1 

0.95 

0.72 

0.48 

0.06 


5. CONCLUSIONS AND FUTURE WORK 

We present a fast and cost-effective template matching 
scheme in this paper. We exploit the commutative prop¬ 
erty of partial circulant matrix to design the sensing matrix 
for template matching. Our theoretical analyses and simu¬ 
lation results show that the proposed method outperforms 
FFT- and sparse FFT-based methods. The future work will be 
examining fast template matching with similarity measures 
other than cross-correlation. 


6. ACKNOWLEDGMENT 


Table [T] further verifies the theoretical probability de- This work was supported by Ministry of Science and Tech- 
rived in Theorem |5] By Theorem |5] the proposed algo- nology, Taiwan (ROC), under grants MOST 104-2221-E-OOl- 
rithm succeeds with the probability being larger than 1 — 019-MY3 and 104-2221-E-001-030-MY3. 











































7. REFERENCES 

[1] H. Hassanieh, P. Indyk, D Katabi, and Eric Price, 
“Faster gps via the sparse fourier transform,” in ACM 
MOBICOM, 2012. 

[2] M. S. Aksoy, O. Torkul, and I .H. Cedimoglu, “An 
industrial visual inspection system that uses inductive 
learning,” Journal of Intelligent Manufacturing, vol. 15, 
pp. 569-574,2004. 

[3] T. Luczak and W. Szpankowski, “A suboptimal lossy 
data compression based on approximate pattern match¬ 
ing,” IEEE Transactions on Information Theory, vol. 
43, pp. 1439-1451,1997. 

[4] R. M. Dufour, E. L. Miller, and N. P. Galatsanos, “Tem¬ 
plate matching based object recognition with unknown 
geometric parameters,” IEEE Transactions on Image 
Processing, vol. 11, pp. 1385-1396, 2002. 

[5] P. E. Anuta, “Spatial registration of multispectral and 
multitemporal digital imagery using fast fourier trans¬ 
form,” IEEE Transactions on Geoscience Electronics, 
vol. 8, pp. 353-368,1970. 

[6] J. P. Lewis, “Fast template matching,” in Proc. Vision 
Interface, 1995, pp. 120-123. 

[7] and S.-H. Lai S.-D. Wei, “Fast template matching based 
on normalized cross correlation with adaptive multilevel 
winner update,” IEEE Transactions on Image Process¬ 
ing, vol. 17, pp. 2227-2235,2008. 

[8] W. H. Pan, S. D. Wei, and S. H. Lai, “Efficient ncc- 
based image matching in walsh-hadamard domain,” in 
Proc. European Conf Computer Vision: Part III, 2008, 
pp. 468^80. 

[9] S. Santini and R. Jain, “Similarity measures,” IEEE 
Transactions on Pattern Analysis and Machine Intelli¬ 
gence, vol. 21, pp. 871-883,1999. 

[10] G. J. VanderBrug and A. Rosenfeld, “Two-stage tem¬ 
plate matching,” IEEE Transactions on Computing, vol. 
26, pp. 384-393,1977. 

[11] M. Gharavi-Alkhansari, “A fast globally optimal algo¬ 
rithm for template matching using low-resolution prun¬ 
ing,” IEEE Transactions on Image Processing, vol. 10, 
pp. 526-533,2001. 

[12] A. Mahmood and S. Khan, “Correlation-coefficient- 
based fast template matching through partial elimina¬ 
tion,” IEEE Transactions on Image Processing, vol. 21, 
pp. 2099-2018,2012. 


[13] S. Mattoccia, F. Tombari, and L. D. Stefano, “Fast 
full-search equivalent template matching by enhanced 
bounded correlation,” IEEE Transactions on Image Pro¬ 
cessing, vol. 17, pp. 528-538,2008. 

[14] H. Schweitzer, R. Deng, and R. F. Anderson, “A dual¬ 
bound algorithm for very fast and exact template match¬ 
ing,” IEEE Transactions on Pattern Analysis and Ma¬ 
chine Intelligence, vol. 33, pp. 459^70, 2011. 

[15] W. Quyang, F. Tombari, S. Mattoccia, L. D. Stefano, and 
W.-K. Cham, “Performance evaluation of full search 
equivalent pattern matching algorithms,” IEEE Trans¬ 
actions on Pattern Analysis and Machine Intelligence, 
vol. 34, pp. 127-143,2012. 

[16] H. Hassanieh, P. Indyk, D Katabi, and Eric Price, 
“Nearly optimal sparse fourier transform,” STOC, 2012. 

[17] D. Valsesia and E. Magli, “Compressive signal process¬ 
ing with circulant sensing matrices,” in IEEE interna¬ 
tional conference on Acoustic, Speech and Signal Pro¬ 
cessing, 2014, pp. 1015-1019. 



